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Abstract: Wireless sensor networks (WSNs) suffers from the 
hot spot problem where the sensor nodes closest to the base 
station are need to relay more packet than the nodes farther 
away from the base station. Thus, lifetime of sensory network 
depends on these closest nodes. Clustering methods are used 
to extend the lifetime of a wireless sensor network. However, 
current clustering algorithms usually utilize two techniques; 
selecting cluster heads with more residual energy, and rotating 
cluster heads periodically to distribute the energy consumption 
among nodes in each cluster and lengthen the network lifetime. 
Most of the algorithms use random selection for selecting the 
cluster heads. Here, we propose a Fault Tolerant Trajectory 
Clustering (FTTC) technique for selecting the cluster heads in 
WSNs. Our algorithm selects the cluster heads based on traffic 
and rotates periodically. It provides the first Fault Tolerant 
Trajectory based clustering technique for selecting the cluster 
heads and to extenuate the hot spot problem by prolonging the 
network lifetime. 

Keywords: Fault Tolerant Trajectory Clustering, Trajectory 
clustering. Wireless sensor networks. Network life time. Cluster 
head. 

I. Introduction 

Wireless sensor networks (hereinafter, WSNs) are networks 
of wireless nodes that are deployed over an area for the pur- 
pose of monitoring certain phenomena of interest. To keep 
specific areas under observation, WSNs deploy hundreds or 
thousands of integrated sensor nodes to sample data from 
observed environment. Although, these devices are not very 
accurate and reliable individually, their deployment in large 
number enhances their accuracy and reliability. In addition 
WSNs can provide area coverage that was not possible with 
other wireless networks. They can be deployed in extremely 
hostile environments, such as near volcanically active sites, 
inside a chemical industry, or probable disaster areas or in 
mainly inaccessible environments. 



WSNs have numerous advantages. They are easier and 
faster to deploy than any wired network. They have a large 
coverage area and longer range. They have higher degree 
of fault tolerance than wireless networks, because failure of 
one or more sensors or nodes does not effect the operation 
of the network and mainly they are self-configuring. The 
nodes perform certain measurements, process the measured 
data and transmit the processed data to a base station over 
a wireless channel. The base station collects data from all 
the nodes, and analyzes this data to draw conclusions about 
the activity in the area of interest. In practice, due to the 
large quantity of sensor nodes, it is infeasible to recharge the 
batteries in WSNs. Therefore, sensor network lifetime is a 
primary concern in sensor network design. 

In literature many researchers proposed various protocols 
to reduce the energy consumption and improve the network 
lifetime of WSNs. Those protocols can be categorized into 
three classes: routing protocols, sleep-and-awake scheduling 
protocols, and clustering protocols. The routing proto- 
cols |[T] Q ||3) Q determine the energy-efficient multi-hop 
paths from each node to the base station. In sleep-and-awake 
scheduling protocols I^SJ I^TJ, every node in the schedule 
can sleep, in order to minimize energy consumption. In 
clustering protocols I'S] fig |[TT) fT^ |[13) |[14^ |T5l, data 
aggregation can be used for reducing energy consumption. 
Data aggregation, also known as data fusion, can combine 
multiple data packets received from different sensor nodes. 
It reduces the size of the data packet by eliminating the 
redundancy |16| pT) . Wireless communication cost is 
also decreased by the reduction in the data packets. Hence, 
by reducing the energy consumption clustering protocols 
increases the network lifetime. 

Clustering is a commonly adopted approach in sen- 
sor networks to manage power efficiently. In clustering, 
sensors in the monitoring area are grouped into clusters; 
all sensor nodes within the same cluster send their data 
to the cluster head, which then forwards the aggregated 
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data to the base station. In this way clustering reduces the 
overhead and increases the network Hfetime; on the other 
hand, the disadvantage is due to heavy usage of cluster 
heads "typically cluster heads die at an early stage" |19|. 
This is sometimes called the hot spot problem |20|. Without 
adding extra nodes or redistributing the available energy, 
this problem is hard to solve. For example, Ref. p9| has 
shown that, with varying transmission power of nodes 
and even considering unlimited transmission ranges does 
not solve the hot spot problem. At the same time, it is 
also envisioned that sensor nodes will become "extremely 
inexpensive" pT) . While beyond a certain node density, 
adding additional nodes does not provide any improvement 
regarding sensing, communication or coverage |22|, adding 
nodes might obviously help to increase the lifetime of a 
sensor network while providing the same service to its users, 
i.e. leveraging sensor values from the same number of nodes. 

Ref. [14 1 proposed the randomized clustering algorithm to 
organize sensors into clusters in a wireless sensor network. 
Computation of the optimal probability of becoming a 
cluster head was presented. Ref. |23 1 defined the maximum 
cluster-lifetime problem, and they proposed distributed, ran- 
domized algorithms that approximate the optimal solution to 
maximize the lifetime of dominating sets on wireless sensor 
networks., |24| considered the k-domatic partition problem, 
and they proposed three deterministic, distributed algorithms 
for finding large k-domatic partitions. 

Ref. |[8) proposed LEACH, a well-known clustering protocol 
for wireless sensor networks. LEACH includes distributed 
cluster formation, local processing to reduce global commu- 
nication and randomized rotation of cluster heads among 
all the nodes in the network. Each cluster selects a cluster 
head, which is responsible for aggregating collected data and 
sending data to base station. LEACH provides a good model 
that helped to reduce information overloaded and provides a 
reliable data to the end user. Together, these features allow 
LEACH to achieve the desired properties. Also, an improved 
scheme of LEACH was proposed, named LEACH-C f9] . In 
LEACH-C, a centralized algorithm at the base station makes 
cluster formation. 

Ref. p5) deals with the problem of finding an energy- 
balanced solution to data propagation in WSNs using a 
probabilistic algorithm was considered for the first time. 
The lifespan of the network is maximized by ensuring that 
the energy consumption in each slice is the same. Sensors 
are assumed to be randomly distributed with uniform distri- 
bution in a circular region or, more generally, the sector of a 
disk. Data have to be propagated by the WSN towards a sink 
located at the centre of the disk, and it is shown that energy 
balance can be achieved if a recurrence relation between 
the probabilities that a slice ejects a message to the sink is 
satisfied. 

Ref. ITO) proposed PEGASIS. PEGASIS makes a commu- 
nication chain using a Traveling Sales Person heuristic. In 
PEGASIS, nodes are organized into a chain using a greedy 
algorithm so that each node transmits to and receives from 



one of its neighbors. A randomly selected node from the 
chain will forward the aggregated data to the base station, 
thereby reducing per round energy expenditure compared to 
LEACH. 

Ref. 1 26 1 proposed clustering-based routing protocol 
called base station controlled dynamic clustering protocol 
(BCDCP), which utilizes a high energy base station to set up 
cluster heads and perform other energy-intensive tasks, can 
noticeably enhance the lifetime of a network. 

Ref. 1 27 1 proposed two new algorithms under the name of 
PEDAP, which are near optimal minimum spanning tree 
based wireless routing schemes. The performance of the 
PEDAP was compared with LEACH and PEGASIS, and 
showed a slightly better network lifetime than PEGASIS. 
Ref. 1 28 1 proposed a new routing scheme; called SHORT, to 
achieve higher energy efficiency, network lifetime, and more 
throughput than PEGASIS, and PEDAP-PA protocols. This 
scheme used the centralized algorithms and required the 
powerful base station. The performance results showed that 
SHORT can achieve better "energy X delay" performance 
than the existing chain based data aggregation protocols. 

Ref. |11| proposed HEED, by extending LEACH and 
considering range limits of the wireless communication 
and intra-cluster communication cost. The probability for 
each node to become a tentative cluster head depends on 
its residual energy, and all the tentative heads in which are 
competing for becoming the final cluster heads. The final 
cluster heads are selected according to the intra-cluster 
communication cost. HEED terminates within a constant 
number of iterations, and achieves fairly uniform distribution 
of cluster heads across the network. 

Ref. p9| proposed EECR, which is an energy efficient 
clustering routing algorithm. The performance of the EECR 
was compared with LEACH, and showed a slightly better 
network hfetime than LEACH. 

However, the unsolved problem of considerable energy con- 
sumption on the cluster formation still exists. Here, we con- 
sider the path followed by the node/sensor to transfer data to 
the base station as the "trajectory", and using the proposed 
trajectory clustering pO) [31) , we cluster the trajectories and 
obtain the "representative trajectory". Then the nodes lies 
in the representative trajectory are considered as the cluster 
heads and the obtained cluster heads will be used for commu- 
nicating data to the base station. Moreover, we concentrated 
on the rotation of cluster heads among all sensor nodes to im- 
prove the lifetime of the network based on the traffic density. 
We tested our proposed method and found that this method 
considerably enhances the hfetime of the network. 

11. Novel Algorithm 

This section considers the wireless sensor networks consist- 
ing of hundreds or thousands of deployed sensor nodes in 
the sensing field. On the basis of | 10J [26 J , it is assumed by 
the following properties of the wireless sensor networks to 
simplify the network model. 



• The base station is located far away from the sensors, 

• The nodes have uniform initial energy allocation and all 
sensor nodes have equal capabilities (data processing, 
wireless communication, battery power). 

• AU sensor nodes have various transmission power lev- 
els, and each node can change the power level dynami- 
cally. 

• Each node senses the environment at a fixed rate, and 

• AU nodes are immobile. 

The sensor nodes are geographically grouped into clusters 
and capable of operating in two basic modes: the sensing 
mode and the cluster head mode |10|. In the sensing mode, 
the node senses the task and sends the sensed data to its clus- 
ter head. In cluster head mode, a node gathers data from its 
cluster members, performs data fusion, and transmits the data 
to the base station. The base station in turn performs the key 
task of cluster head selection. 

A. Cluster head selection 

Initially the nodes will transmit a hello packet to the base 
station. For calculating the shortest path, we can consider 
various Quality of service (QoS) parameters like spatial 
distance, processing delay, available bandwidth, allocated 
buffer space etc., in this phase of research we consider only 
spatial distance. Then, we used Dijkstra's p2) algorithm 
for finding the shortest path of the hello packet. After 
receiving hello packets from the nodes, using the Trajectory 
Clustering algorithm, the base station computes the repre- 
sentative trajectory by clustering the trajectories (here the 
trajectory is nothing but the path used by the node to transfer 
its data to the base station). The nodes of the obtained 
representative trajectory are considered as the cluster heads. 
Then the base station splits the network into clusters (equal 
to the number of nodes in the representative trajectory), 
and identifies the nodes in the representative trajectory as 
the corresponding cluster heads. Then, the base station 
broadcast a message to the network mentioning about the 
nodes and their corresponding cluster heads. Subsequently 
the nodes will use its cluster heads to transmit its data. This 
process will be performed periodically and the cluster heads 
will change based on the traffic. 

Cluster head selection routine contains the following stages :- 

1 . Base station computes the cluster heads using TC algo- 
rithm; 

2. Split the network into N clusters; and 

3. Broadcast message to all nodes mentioning cluster 
members and their corresponding cluster heads 

B. Trajectory Clustering 

The success of any clustering algorithm depends on the 
adopted dissimilarity measure. Following section explains 
about the adopted dissimilarity measure. 



Agrawal et al., p3) , proposed the usage of Euclidean 
distance between time series of equal length as the measure 
of their similarity. The idea has been generalized in | [34) for 
subsequence matching. In a similar way |35| used Discrete 
Wavelet Transform and |36J used Principal Component 
Analysis for measuring time series similarity. Another 
approach which is brought from image processing is Time 
Warping technique and it is used in |37| to match signals 
in speech recognition. A similar technique is used to find 
longest common subsequence (LCSS) of two sequences 
using fast probabilistic algorithms to compute the LCSS, 
and then define the distance using the length of this subse- 
quence 1 38]. In 1 39 1 suggested this technique to measure the 
similarity of time-series data in data mining. 

Here we adopted Hausdorff distance [401 for calculating 
dissimilarity between trajectories. The following are some 
of the definitions used in our algorithm. 

Definition 1: A trajectory (t) is represented as 
^}itid,uo,ui,U2--,Un) whcrc (tid) is a unique trajectory 
id (data packet), and (iio,Mi, 1*2,. .,"««) is a sequence of nodes 
reflecting the spatial position of the node. 

Definition 2: We define the spatial dissimilarity function be- 
tween two trajectories ti and t2 as the maximum of one way 
distances between two trajectories. 

The one way distance from a trajectory ti to another trajec- 
tory t2 is defined as the integral of the Hausdorff distance 
between points of ti to trajectory t2 divided by the number 
of points in ti (|ti|). 

distow(ti,t2) J dhip,t2)dp 

The Hausdorff distance from a trajectory point p to another 
trajectory t2 is defined as d{p,t2) — miriggtj {d{p,q)}.Ths 
distance between trajectories ti and t2 is the maximum of 
their one way distances. 

dist(ti,t2) = max{distow(ti,t2),distow(t2,ti)} 

Clearly the distow{ti,t2) is not symmetric but dist{ti,t2) 
is symmetric. Note that disto^{ti,t2) is the integral of the 
shortest distances from points in ti and t2- 

1 ) Fault Tolerant Trajectory Cluster Routine 

Trajectories are grouped into clusters using the threshold. 
Here the threshold is considered as a maximum value, such 
that all trajectories are grouped into a single cluster. The tra- 
jectory cluster routine contains the following stages: 

1. Dissimilarity matrix for trajectories will be computed 
using the Hausdorff distance, 

2. Using following Initialization Algorithm trajectories 
are grouped into initial clusters; 

(a) Take first sample as first cluster Classify all the 
remaining trajectories into this cluster if they are 
within the threshold. 



(b) Take a trajectory (sequentially) which is not al- 
ready classified into any of the cluster and con- 
sider it as a new cluster Take all the other tra- 
jectories which are not kept in any of the clusters 
and keep in this cluster if they satisfy the threshold 
limit. 

(c) Repeat step b till no new clusters are added. 

3. Using the following RepTraj Algorithm representative 
trajectories are computed. 

(a) For each Trajectory of cluster C calculate cumula- 
tive dissimilarity with all other trajectories of the 
same cluster C. Select the trajectory which is hav- 
ing minimum cumulative dissimilarity and take 
this as representative trajectory of that cluster 

4. By considering the trajectories received from step 3, as 
initial cluster centers, using the follwing Re-cluster Al- 
gorithm re compute clusters and their representative tra- 
jectories until there is no change in the representative 
trajectories. 

(a) For each Trajectory calculate dissimilarity with all 
the representative trajectories and classify to the 
cluster for which dissimilarity is low. 

(b) Re-calculate representative trajectories using Rep- 
Traj Algorithm. 

C. Fault Tolerant Routine 

To make the system fault tolerant, instead of selecting only 
one optimal representative trajectory, routine is asked to se- 
lect next p number of optimal trajectories, based on their pri- 
ority levels, and there corresponding cluster heads will be 
selected for communicating data to the base station. This list 
will keep by the base station, in any case of fault occurs with 
the initial cluster heads, as a choice it can go for other alter- 
natives. 

D. Data communication phase 

There are three steps during the data communication phase: 
data collection, data fusion and data transmission. Initially 
each sensor node transmits the sensed information to its 
cluster head at the time slot assigned by its cluster head. In 
order to save its energy, the node will close transmit part 
during the time slot which is not required to it. Once data 
from all sensor nodes have been received, the cluster head 
performs data fusion on the collected data and reduces the 
amount of raw data that need to send to the base station. 
Once the data gathering and data fusion are completed, the 
cluster head sends the compressed data to the base station. 

As mentioned previously, all the nodes can work as cluster 
heads. Due to this, any node can become a cluster head 
or a cluster member At each turn, the cluster head calcu- 
lates available power and compares with the cluster mem- 
bers. Whenever the cluster heads power becomes less than 
the minimum power holding, then the cluster heads inform to 
its cluster members and assigns the maximum power holding 
cluster member as the cluster head and the same inform to 



the base station. Whenever the cluster head is changed, base 
station repeats the cluster finding process and modifies the 
clusters. 

III. Experimental Work 

To evaluate the performance of the algorithm, it has been 
simulated and compared its performance with energy effi- 
cient clustering routing (hereafter, EECR). Before the simu- 
lation and results are introduced, the radio model and some 
important parameters |29J used in simulation have been de- 
scribed. 

A. The radio model 

We have used both the free-space propagation model and the 
two-ray ground propagation model to approximate path loss 
sustained because of wireless channel transmission. Given a 
threshold transmission distance of dO, the free-space model 
is used when d < do and the two-ray model is applied for 
cases when d > do. Using these two models, the transmit 
energy costs for the transfer of a b-bit data message be- 
tween two nodes separated by a distance of d meters is given: 

ifd < do, Et ib,d) = ET^b-^Ea,npid)b = ET^b+eid%-{l) 

ifd > do, Et (b, d) = bEBpb + e2d% (2) 

With regard to the energy cost incurred in the receiver of the 
destination node, we give in Eq. (3): 

Et [b] = ER^b (3) 

We have summarized the different meanings and values for 
energy terms in Table 1. Energy consumed during data ag- 
gregation in the cluster head E^^, is also taken into account. 

B. The number of clusters 

We assume that N nodes are distributed in the area of A*A 
randomly. If there are M clusters, then there are N/M nodes 
in each cluster on an average. Every cluster head receives 
the sensed data from its cluster nodes, aggregates all the data, 
and sends it to the base station. 

The total energy spent on transmitting a frame for every clus- 
ter head can be expressed as: 

N N 
El = bEr^— + bEdaj:^ + be2d\ (4) 

where di is the distance between cluster head and base sta- 
tion. 

In one frame, the cluster nodes transmit the sensed data mes- 
sages to its cluster head. The energy spent for each cluster 
member is as below: 

E2 = bETx + bsidl (5) 

where d2 is the distance between the member node and its 
cluster head. If the cluster head is in the center of the cluster, 
the density of every cluster is p— M/A2, then d2 can equate 
to 



Table 1: Summarizes meaning of each term and typical value 



S.No 


Term 


Meaning 


Value 


1 


Eda 


Consume energy for data aggregation 


5nJ/bit 


2 




Radio Electronics Energy 


50nJ/bit 


3 




Transmit applied for free space 


10pJ/(bit*m2) 


4 


£2 


Transmit applied for two way model 


0.0013 pj/(bit*m*) 



The energy spent for each cluster member is modified as: 

1 

E2^bET^ + bei — — (7) 

ZTT M 

The energy dissipation in a cluster can be expressed as: 



1 

E2 = hExx + bei (8) 

The total energies dissipated in all the clusters can be ex- 
pressed as: 



E = MEr 



2Et,N + EdaN + Me2d\ ^ 



(9) 



if, 



dE 
dM 



0, we can get the following Eq. (10) 



M = A, 



N 



£1 



2tt {e2d\ - Et.) 



(10) 



In our simulation, we consider N = 100, A = 100 and di 
= 90m, and for various number of clusters i.e., from six to 
twelve. 

1) Results 

It has been simulated that 100 nodes randomly located in 
the sensing field of 100 X 100 rn^ with the base station 
located at least 90 m away. All sensor nodes periodically 
sense events and transmit the data packet to the base station. 
All sensor nodes start with an initial energy of 2 J and 
the data message size is fixed at 516 bytes, of which 16 
bytes represent the weight value. We choose three different 
coefficients Ci = 0.5, C2 = 0.4 and C3 = 0.1. To evaluate the 
performance of our algorithm, we compare its performance 
with EECR. 

Performance is measured by the number of rounds alive and 
the total data messages successfully deUvered. Following fig- 
ures 1 & 2 shows the simulated results. It is obvious that 
our algorithm outperform EECR in the number of rounds 
the nodes alive. The nodes that remain alive in EECR are a 
maximum of 175 rounds, whereas with our proposed method 
rounds alive are 350 {see Fig. [TJ. If the system life time 
is defined as the number of rounds alive, with our proposed 
technique system life can increase 90%. Subsequently the 
number of packets delivered at the base station during the 
number of rounds of activity is increased from a maximum 
of 40000 (EECR) to 70000 (see Fig. |2|. 

C. Fault Tolerant system 

To design the system as fault tolerant, here the lists of cluster 
heads on their priority level are being provided; which are 
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Figure. 1: Number of rounds alive 
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Figure. 2: Packets delivered 



obtained from the next representative trajectories. This list 
will keep by the base station, in any case of fault occurs with 
the initial cluster heads, as a choice it can go for other alter- 
natives. Following Table 2 shows the list of cluster heads, the 
number of nodes handled by cluster head and their expected 
life time. 

IV. Conclusions 

In this paper, a novel Fault Tolerant Trajectory based cluster- 
ing solution is presented for selecting cluster heads in wire- 
less sensor networks. Trajectory clustering algorithm enables 
sensor nodes to reduce data packets by data aggregation. The 
wireless communication cost is decreased by reduction of 
data packets, and thus the clustering technique extends the 
lifetime by reducing the energy consumption of the network. 
The simulation results demonstrated that our proposal sig- 
nificantly improves the lifetime and reduce the energy con- 



Table 2: List of Cluster heads based on Priority 



Priority 


List of Cluster Heads 


No. of nodes handles by each cluster head 


Expected Life time (in terms of rounds) 


1 


0, 34, 69, 39, 33, 68 


28, 11,30, 15,7,9 


368 


2 


0, 34, 69, 39, 33, 68 


36, 6, 27, 15, 7, 9 


368 


3 


1, 68, 0, 34 


51, 14, 16,9 


351 


4 


0, 87, 34, 68 


60, 6, 11,23 


351 



sumption of wireless sensor networks compared with exist- 
ing clustering protocols. We assume that the nodes are error 
free. However, error will arise due to the noise in the real 
network environments. As a future work, we plan to extend 
the method to increase its robustness. 
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